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1 Introduction 


Sediment toxicity bioassays are often conducted to support the evaluation of 
dredged material [Hoposed for open-water disposal under Section 103 of the 
Marine Protection, Research, and Sanctuaries Act (MPRSA) of 1972 (Public 
Law (PL) 92-532) and Section 404(bXl) of the Federal Water Pollutimi Con¬ 
trol Act of 1972 (PL 92-500), as amended by the Clean Water Act (CWA) of 
1977 (PL 95-217). Bioassays are conducted to evaluate the potential toxicity 
of sediment-associated contaminants such as heavy metals, petroleum hydro¬ 
carbons, and chlorinated organics. Developing diese bioassays requires consid¬ 
erable time and research in a variety of topic areas. Srane bioassays are 
intuitively more developed and more ^propriate for regulatory evduations 
than others. Judging the developmental status of sediment toxicity bioassays 
for the regulatory evaluation of dredged material has been difficult because of 
a lack of developmental criteria. The paradigm described herein helps meet 
that need. 

This report describes a paradigm for developing sediment toxicity bioassays 
for the regulatory evaluation of dredged material. This developmental para¬ 
digm serves several functions. It provides a framework for judging the devel¬ 
opmental status of any sediment bioassay. This permits both scientific and 
regulating persoimel to gauge a test’s completeness any time during its evolu¬ 
tion. It also allows one to compare the relative development of multiple bioas¬ 
says. Describing the general pattern of test development will readily reveal 
gaps in our knowledge. This will permit investigators and program managers 
to optimize limited resources by directing research to areas needing immediate 
attention. Describing a logical sequence for test develofHnent should accelerate 
the development of sediment bioassays still in the conceptual stage. Fmally, 
the latter stages of this paradigm suggest a process whereby die U.S. Arniy 
Corps of Bigineers (USAGE) and the U.S. Environmental Protection Agency 
(USEPA) can incorporate sediment toxicity bioassays into the evaluation of 
dredged material proposed for open-water disposal. This paradigm, therefore, 
serves the important function of interfacing science and public policy. 
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2 Approach 


Initial Guidance (1992) 

Written guidance for judging the developmental status of sediment toxicity 
bioassays did not exist when this project began (1992). Consequently, initial 
input was obtained by telei^ne from about 40 individuals in the scientific and 
regulatory communities. Each person was asked to describe the characteristics 
they would expect to see in a folly developed sediment toxicity bioassay for 
the regulatory evaluation of dredged material. Their input formed the tois for 
initial guidance for what constitutes a folly developed sediment toxicity bioas¬ 
say for the regulatory evaluation of dredged material (Dillon 1992). 


Peer-Review Written Comments and Workshop 
(1993) 

The following year, Dillon (1992) was extensively peer reviewed. Written 
ctmunents were solicited and received from a broad group of individuals. 
Follow-up discussions took place at a workshop held 16-17 June 1993 in Den¬ 
ver, CO. The written comments and workshop discussions were used to for¬ 
mulate the final developmental paradigm for sediment toxicity bioassays 
rqrotted herein. Over 70 individuals provided input during this 2-year project 
(Table 1). 
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Results 


General Evolutionary Pattern of Sediment 
Toxicity Bioassays 

The development of sediment toxicity bioassays usually progresses through 
a series of stages or phases. At each stage, the protocol is modified to reflea 
improvements demonstrated through research and experience. Initially, an 
investigator will propose a bioassay and condua preliminary laboratory 
research. If the bioassay looks promising, other investigators with different 
perspectives and backgrounds may evaluate the bioassay. This independent 
resesffch and development exposes both strengths and weaknesses of the pro¬ 
posed sediment bioassay. Tests that survive this peer scrutiny may become 
widely accepted in the scientific conununity. If so, succeeding activities con¬ 
centrate less on test development and more on performance, e.g., the bio¬ 
assay’s discriminatory power. If a broad consensus develops and sufficient 
research has been conducted, a standard method may be published usually by 
an independent standard-setting organization; e.g., American National Stan¬ 
dards Institute (ANSI), American Society for Testing and Materials (ASTM), 
American Society for Quality Control (ASQQ, in Standard Methods. The 
standardized sediment bioassay test method may then be considered by the two 
Federal ageiKies that have statutory responsibility for the national dredging 
program (USAGE and USEPA). These agertcies evaluate the appropriateness 
of the bioassay from the perspective of managing a major regulatory program. 
This final (rtia% represents tlK interface betw^n science and public policy. 


A Developmental Paradigm for Sediment Toxicity 
Bioassays 

Based on inimt from a wide variety of individuals (see Table 1), a develop¬ 
mental paradigm for sediment toxicity bioassays was created (Table 2). This 
paradigm reflects the general evolutionary pattern of sediment bioassays 
described above, has been extensively peer reviewed, and represents broad 
scierttific consensus. Elements of the developmental paradigm are described 
below. 
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Phase I - Initial development by test proponent 
Phase I Scoping 

Rationale. The test proponent must explain how the proposed sediment 
bioassay will be used in the legul^ry evaluation of dredged material. This 
obviously requires some knowledge of the regulatory milieu. Without a clear 
rationale, considerable resources may be expended developing a test for which 
there is no practical use. For example, is tire bioassay intended to evaluate 
deposited or suspended sediments? Is it desigrred for early tier screening or 
more detailed later evaluations? Will it be used in the ocean or inland disposal 
programs? 

The rationale may describe other a{^lications beyond dredged material 
testing (e.g., field surveys of in situ toxicity and risk assessments). Existing 
bioassays developed for other puiposes must still include a rationale specific to 
the regdatory evaluation of dredged material. 

Assessment/measurement end points. The concept of assessment and 
measurement end points was adopt^ by USEPA while developing its Frame¬ 
work for Ecologies Risk Assessment (USEPA 1992). Assessment end points 
are “formal expressions of the actual environmental value that is to be pro¬ 
tected” (Suter 1990). Assessment end points are identified by managers aiKl 
reflect their decision-makittg environment (e.g., societal concerns). Examples 
might include the following: (a) safeguard local fish and shellfish populations, 
(b) maintain healthy benthic habitat, (c) {sotect threatened and endangered 
species, etc. A measurement end point is “a measurable envirotunental charac¬ 
teristic that is related to the valued draracteristic chosen as the assessment end¬ 
point” (Suter 1990). Measurement end points are identified by tire scientific/ 
technical communities. Ecological models are often used to link measurement 
end points to assessment end points. For example, if the manager wishes ro 
protect a local fishery resource, a population demogr^hic model may be used 
to link observed effects on individual organisms to future population viability. 
Developing assessment and measurement end points represents a dialog 
between the environmental manager and the scientific/^hnical community. 

All too often this dialog occurs only after the technical work has been 
completed. 

The coircept of assessment and measurement end points is used in the 
USEPA/USACE dredging program and has direct application to the develop- 
mem of sediment toxicity bioassays. Federal statutes (Mn(SA and CWA) 
require “no uiracceptable adverse impacts” on the environment as a result of 
digged material disposal. This statutory language is a national assessment 
end point. Several measurement end points are used to evaluate this assess¬ 
ment end point. One of these, sediment toxicity, has traditionally been mea¬ 
sured by determining survival of very sensitive test species after short-term 
exposures. Selection of test end points and test species, therefore, are two of 
the first and most important steps in the development of sediment toxicity 
bioassays. 
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Test end points. I^edged material toxicity bioassays have relied t^avily 
on survival as a test end point. While this will continue to be true, a new 
generation of sediment bioassays with suUethal end points and longer 
(chronic) sediment e;qx>suies is now evolving (Dillon 1993). Although the 
potential number of sublethal end points is very large, the practical number of 
usable end points for regulatory programs is much smaller. Test end points 
must be ecologically relevant, easily understood by the general public, and not 
too difficult to routinely measure. End points must also harmonize with the 
national assessment end point of “no unacceptable adverse impacts.” Repro¬ 
duction and growth are two highly desirable sublethal test end points that meet 
these criteria and enjoy broad scientific consensus (Dillon, Gibson, and Moore 
1990). 

Test species. Selection of the test species is critical to the success of the 
bioassay. Following are some important selection criteria. 

a. Available throughout the year. Sufficient numbers of healthy organ¬ 
isms must be readily available throughout the year either through labo¬ 
ratory cultures or field collections. 

b. Handleable. One must be able to routinely maintain and manipulate 
the test species (including shipping) as required by the laboratory proto¬ 
col. Consistent, acceptable responses in the negative and positive con¬ 
trols must be achievable by contraa laboratories. 

c. Compatible with test media. The habitat, substrate, and nutritional re¬ 
quirements of the test species must harmonize with the test media. For 
examine, infaunal species should be used to evaluate bedded sediments, 
while epibenthic, planktonic, or nektonic species are used with sus¬ 
pended sediments. Grain-size tolerance should be compatible with the 
test material. 

d. Appropriate sensitivity. The test species must not be insensitive to 
major classes of environmental contaminants. 

e. Ecologically important. The biology and natural history of test species 
should document its ecological importance; e.g., carbon flow and nutri¬ 
ent cycling. 

/. Commerciallyfrecreationally important or indigenous. Regional con¬ 
cerns to maintain and protect local populations of biological resources 
may be an important consideration in species selection. 

Literature review. All pertinent information should be documented by the 
test proponent in a thorough review and analysis of the literature. 

Alpha protocol. Development of sediment toxicity bioassays is not unlike 
the develoinnent of computer software. A succession of “new and improved” 
versions appear with continual “debugging,” refinements, and an expanding 
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user group. This paradigm uses similar terminology to connote progressive 
development of sediment bioassays. For example. Phase / Scoping ends with 
a suggested test method called the Alpha protocol. It is the first time the sug¬ 
gested bioassay procedure is put to paper. The Alpha protocol may or may 
not be published. 


Phase I Laboratory research and development 

Statistical design. Statistical design is the a priori description of what 
types of data and analytical methods are required to adequately test a given 
hypothesis. Rigorous experimental research begins with a sound statistical 
design. Important components include but are not limited to the fallowing: 

a. Hypothesis formulation. 

b. Hypothesis testing (data reduction/data analysis). 

c. Level of significance (a error). 

d. Power analysis (p error). 

e. Number of treatments, number of replicates/treatment. 

/. Intralab precision. 

Experimental design and procedure. Experimental design is a detailed 
description of how the statistical design will be implemented and the bioassay 
conducted. It includes but is not limited to the following: 

a. Manipulation of sediment before, during, and after the test. 

b. Manipulation of test species before, during, and after the test. 

c. Physical conditions (temperature, photoperiod, aeration, etc.). 

d. Replicate description (size, number of animals/replicate, etc.). 

e. Feeding regime. 

/. Daily activities (visual observations, water quality, etc.). 

g. Duration of test. 

h. Test initiation/termination procedures. 

Quality assurance (QA)/quality control (QC). QA/QC are the adminis¬ 
trative and technical steps taken to ensure reliable data are generated with 
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specified precision and accuracy. They include but are not limited to the 
following: 

a. Good laboratory practices (GLPs). 

b. Standard operating procedures (SOPs). 

c. Acceptable response in negative controls. 

d. Consistent response in positive controls/use of control charts. 

e. Data audits. 

/. Corrective action procedures. 

Test ruggedness. American Society of Testing and Materials (/ STM) 
(1992a) defines “ruggedness” as the “insensitivity of a test method to depar¬ 
tures from specified test or environmental conditions.” Some “departures” can 
be managed through good laboratory practices, a well-developed QA/QC pro¬ 
gram, and by strict adherence to a published standard protocol. Other aspects 
of test raggedness, however, are more problematic. Nontreatment factors not 
related to sediment-associated contaminants can originate from three sources: 
(a) geof^ysical [MOperties of dredged material, (b) health and condition of the 
test species, and (c) unforeseen deviations in experimental conditions. Some 
potentially important nontreatment factors include the following: 

a. Grain size. 

b. Atnmonia/sulfide toxicity. 

c. Interstitial salinity/hatdness. 

d. Macronutrients and micronutrients. 

e. Sediment manipulation. 

/. Feeding regime. 

g. Water movement (static, static-renewal, flow-through). 

h. Seasonal/reproductive condition. 

{. Acclimation. 

j. Presence of indigenous predators/competitors. 

k. Shipping. 
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Nontreatment factors can bias test results of acute lethality sediment bioas¬ 
says (DeWitt, Ditsworth, and Swartz 1988; Ankley, Katko, and Arthur 1990; 
Word et al. 1991). Their potential influence will increase when chronic suble- 
thal sediment bioassays are more widely used. It is important, therefore, to 
address nontreatment factors during test development. Guidance for eiqieri- 
mental determination of test ruggedness is available (ASTM 1992b). Rugged¬ 
ness may also be evaluated by testing sediments represeming a wide range of 
nontreatment factors. Whatever the approach, results should be summarized as 
a matrix of conditions for which the sediment bioassay is or is not appropriate. 

Dredged material testing. During Phase I, the bioassay should be con¬ 
ducted with samples of dredged material. Sediments should represent a broad 
spectrum of suspected toxicides and geophysical characteristics. Success (or 
failure) of this dredged material testing will be a function of the quality and 
quantity of preceding research and development 

Peer-reviewed publications. The test proponent must communicate 
research results in the peer-reviewed literature. These publications serve sev¬ 
eral fimctions. First, they permit simultaneous access to the test protocol. 

Prior to publication, knowledge of a particular bioassay is anecdotal and gener¬ 
ally limited to informal conununications among colleagues. Acceptance for 
peer-review publication, however, does not necessarily imply broad acceptance 
by the scientific community. 

Second, scrutiny during the peer-review process greatly increases the 
chances that weaknesses in the test method will be uncovered. This is a 
healthy process. Exposing weaknesses does not necessarily disqualify a test. 
On die contrary, it usually leads to significant improvements. At the very 
least, this scrutiny helps define the limits of the bioassay. 

Third, authors of a good, well-written journal article will identify knowl¬ 
edge ga^ and recoriunend important areas for further research and 
development. This discussion promotes scientific debate and stimulates other 
researchers. At this point, the sediment bioassay is beginning to move out of 
its laboratory of origin and into the larger family of research laboratories. 

Beta protocol. After initial research results have been published, the pre¬ 
liminary Alpha protocol may be revised to a Beta version. The Beta version 
may be published as a methods journal article, a technical report, or in some 
other fo- n. It may be prudent to provide informational copies to the 
USACL and USEPA in anticipation of eventual regulatory use. The published 
Beta (notocol is what other li^ratories follow in Phase II. 


Phase II • Evaluation by multiple laboratories 

Continued research and development. The research and development 
described in Phase I is rarely completed by the initial test proponent. 
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Evaluation by multiple laboratories greatly leverages the research effon by 
providing additional resources. 

Interlaboratory studies. Interlaboratory studies are generally conducted to 
determine the statistical variability (precision) among laboratories. Factors 
contributing to this variation include (a) experience and expertise of the opera¬ 
tor, (b) instrument type and calibration, and (c) the environment in which the 
test is conducted. ASTM (1992c) has provided a standard practice for con¬ 
ducting intetlaboratory studies. For sediment toxicity bioassays, interiaboratory 
studies require a well-written test method that can be executed by participating 
laboratories. Great care must be taken to ensure each laboratory is testing the 
same dredged material at the same time. Successful interiaboratory studies 
demaixl considerable resources, committed participants, and proactive project 
management 

Interpretive guidance. Interpretive guidance is the technical information 
that regulating agencies use in judging the importance of bioassay results. 
Interpretive guidance may be based on laboratory experiments, field studies, 
and best professional judgment. For example, if the bioassay end points are 
survival, growth, and/or reproduction, the technical basis might be a calibrated 
population demographic model. Field studies involving contaminant gradients 
can also provide helpful insights for interpreting test results. 

Because interpretive guidance can have several sources, it usually emerges 
only after multiple laboratories have evaluated the sediment bioassay. Inter¬ 
pretive guidance has both a statistical and biological component The former 
includes the power of the test, intralaboratory and interlatoratory variability, 
etc. The latter refers to the biological significance of observed results. For 
examine, if a particular dredged material reduces survival by S percent relative 
to the reference sediment, is that biologically significant? Would a lO-pencent 
decrease be twice as bad or only increment^y injurious? Would a SO-percent 
reductitm represent an order of magnitude increase in toxicity? Providing 
technically sound interpretive guidance for these and other test end points 
represents a significant challenge to the scientific community (Dillon 1993). 

Testing with a wider range of dredged material. As multiple laborato¬ 
ries evaluate the sediment bioassay, the number of dredged materials tested 
will increase. These additional samples should represent a range of toxicities 
and geophysical characteristics. 

Spedes sensitivity to mitior contaminants. Sensitivity of the test species 
to major classes of contamirumts (metals, chlorinated hydrocarbons, petroleum 
hydrocarbons, and pesticides) should be documented. This information can aid 
in species selection and test interpretation. For example, if a particular sedi¬ 
ment is contaminated with heavy metals, one might wish to select a test 
species that is especially sensitive to that class of contaminant. Relative 
contaminant sensitivity may help explain why some sediments are toxic to a 
particular test species, while others sediments are not. Likewise, it may 
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exfdain why exposure to one sediment resulted in significant mortalities in one 
species but not another. 

Contaminant sensitivity information can be especially useful when 
expressed as a dose-response relationship. Klassen (1986) described this rela¬ 
tionship as “the most fundamental and pervasive concept in toxicology.” Its 
centrality is due to the fact that it establishes chemical-specific causality. 
Dose-response data identify what levels of a contaminant are toxic and, impor¬ 
tantly, which levels are not toxic. Sediments are complex mixtures. Some 
knowledge of chemical-specific causality may be insightful or even essential in 
developing technically sound sediment bioassays. In field validation studies, 
causal dose-response data can corroborate correlative field responses observed 
along a contaminant gradient (see Verification/Validation). When combined 
with a knowledge of feeding behavior and microhabitat exposures, chemical- 
specific dose-response data can help distinguish sensitivity to contaminated 
sediments from contaminant sensitivity. Relative dose-response data are also 
e^dally powerful for assessing interspecific differences in xenobiotic metab¬ 
olism for major contaminants. 

Cost and logistics. By this point, the technical community should be aUe 
to document the cost and logistics associated with conducting the Ixoassay. 
Does it require extensive caintol outlay? Is it cheap and easy to run? What 
parts of the test are most difficult? Which procedures require intensive 
mentoring. 

Peer-reviewed publications. As in Phase I, technical results generated in 
the multiple laboratories must be communicated in peer-reviewed publications. 

Acceptance by the scientific community. Scientific acceptance of a 
sediment toxicity bioassay is a primary consideration of the user community. 
There is no written guidance for determining when scientific acceptance has 
been achieved. Rather, a “survival of the fittest” process usually takes place. 
After multiple laboratory evaluation, some Inoassays are utilized with greater 
frequency, while others receive less and less attention. Some disappear from 
use altogether. This is a slow but l^thy process. Qose scrutiny by many 
investigators helps ensure survival of tests that work and are biologically 
meaningful. If ^s process has one weakness, it is detennining when a partic¬ 
ular test has been accepted (or rejected) by the scientific community. Based 
on discussions at the peer-review workshop in Denver, CO, the following 
criteria for scientific acceptance of a sediment toxicity bioassay were agreed 
upon. 


a. Written protocol available. 

b. Technical basis for protocol published in peer-reviewed journals. 

c. Consistently used by multiple laboratories. 

d. Provides interpretalde results of environmental significance. 
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Ganuna protocol. Phase II terminates with a Gamma version of the sedi¬ 
ment bioassay. This upgrade of the Beta version incorporates all experiences 
and research results of multiple laboratories. Again, sending infoimational 
copies to the regulating agencies may be prudent. In some instances, the Beta 
or Gamma protocol may be an ASTM standard guide produced by Subcom¬ 
mittee E47.03 on Sediment Toxicity. ASTM (1992d) defines a standard guide 
as “a series of c^ons or instructions that do not recommend a ^cific course 
of action.” 

By definition, ASTM guides are not step-by-step “cookbooks.” For that, 
ASTM has another end product, the standard test method, which they define as 
“a definitive procedure for the identification, measurement, and evaluation of 
one or more qualities, characteristics, or properties of a material, product, 
system or service that produces a test result.” 


Phase III > Development of a standard test method 

Intertest comparisons. Intertest comparisons evaluate the sediment bioas¬ 
say’s discriminatory power. That is, how frequently and with what precision 
does die test indicate toxicity relative to other sediment bioassays. Most inter¬ 
test studies have found diat no single txoassay is consistently tire most sensi¬ 
tive and precise (Burton et al. 1989; Giesy and Hoke 1989; Long and 
Buckman 1989; Pastorok and Becker 1990). For that reasrai, a frequent rec¬ 
ommendation is to use a battery of sediment bioassays. To be valid, intertest 
studies should be conducted on the same sediment, at the same time, and, 
ideally, side by side in the same laboratory. Unique features of the bioassay 
(e.g., recommended temperature and salinity) should be retained. 

Veriflcation/validation. There are no formal guidelines for 
verifyingA^alidating sediment toxicity bioassays for dredged material testing. 
Comparable guidance, however, can be gleaned from the field of ecological 
modeling (Jorgensen 1988). An ecological model is said to be validated when 
model outputs approximate real world values. That is, the model is predictive. 
Validation occurs near the end of model development, after equation 
calibratitHi, sensitivity analysis, and model verification. Data for model valida¬ 
tion must be collected independently from data used to calibrate the model. In 
addition, the domain of validation data should represent a wide range of forc¬ 
ing functions. 

Verification, on the other hand, occurs early in model development, often 
iteratively with model calibration and sensitivity analysis. Verification is an 
evaluation of the model’s behavior and internal logic. An ecological model is 
said to be verified when “it behaves in the way the model builder wanted it to 
behave” (Jorgensen 1988). Obviously, verification is a very subjective pro¬ 
cess. In contrast, validation is the objective independent evaluation of die 
model’s predictive ctqiability. 
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In the context of dredged material testing, sediment toxicity bioassays are 
simplistic ecological models. As measurement end points, they quantitate 
potential changes in the Federal assessment end point of “no unacceiMable 
adverse impacts" (see Assessment/measurement end points). Verification of 
these ecological models occurs when the sediment bioassay behaves as 
expected. In contrast, validation of a sediment toxicity bioassay has been 
achieved only when it has been shown to be predictive of dredged material 
disposal impacts. In die context of ecological models, verification and valida¬ 
tion are distinctly different terms requiring disparate levels of effort to achieve. 

Verification of sediment toxicity bioassays may be accomplished in at least 
two ways. One is to conduct the bioassay with samples of d^ged material 
r^resenting a range of suspected toxicides. If it responds as expected, it is 
verified. On a smaller scale, one can dilute a toxic sediment and expect to see 
a corresponding response gradient in bioassay results. In either case, the bio¬ 
assay’s state variables (i.e., the test end points) are resptmding as expected to 
increasing forcing fiincdons (i.e., the sediment-associated contaminants). Many 
sediment toxicity bioassays have been verified. 

In contrast, few sediment tnoassays have been validated. To do so, one 
first makes a prediction regarding dr^ged material impacts. This predictitm is 
based on predisposai bioassays. One then monitors the di^sal event to see if 
the model predictions for those sediments were correct. Field validation has 
rarely been attempted for large-scale dredging projects. A more feasible 
approach would utilize mesocosms or small-scale disposal events. Here, vali¬ 
dation of the bioassay prediction ^or a variety of materials could be assessed 
with precision and accuracy. Bioassay response to sediments collected along a 
natural pollution gradient may also provide valuable insights. However, since 
these are correlative observations, cme must be able to exclude the influence of 
covarying parameters and historical events unrelated to present levels of sedi¬ 
ment contaminatiort One way to strengthen these correlative data is to corrob¬ 
orate with chemical-specific causal relationships generated in the laboratory 
(see Spedes sensitivity to mitiot' contaminants). Qearly, further discussions 
on the verification and validation of sediment toxicity bioassays for dredged 
material testing ate warranted. 

Peer-reviewed publications. As in previous phases, results must be com¬ 
municated in peer-reviewed publications. 

Protocol published by standard-setting group, niase III ends with the 
publication of the sediment bioassay as a standard method. Possible outlets 
include ANSI, ASTM, AS(K^, Standard Methods, etc. 


Phase IV - Evaluation by usw groups 

Once a standard method for the sediment bioassay exists, user groups can 
evaluate its utility for the regulatory program. Users include the following: 

(a) the Federal regulating agencies (USAGE and USEPA), (b) the States in the 
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case of Section 404(b)(1) evaluaticxis. (c) the bioassay contracting community, 
and (d) monbers of the private sector seeking permits to dredge. Phase IV 
reiMesems the interface between science/technology and public policy. This 
interfacing occurs on both a national and regional basis. 

Joint agency consideration (USEPA/USACE). Resptmsibility for the 
dredging program is shared jointly by the USEPA and USAGE. The USAGE 
evaluates the material, and the USEPA reviews the evaluation. For Sec- 
ti(m 103 actions, bioassays must be jointly approved by both agencies. Their 
evaluation of a candidate sediment bioassay begins with its scientific merits, 
but includes other less technical issues. As (Niblic servants and custodians of 
Federal monies, the USAGE and USEPA are required to consider and balance 
resource expenditures with benefits received for all Federal actions. They must 
be able to exfdain to the public or, in the case of permitted activities, to the 
private sector precisely why the sediment bioassay is being conducted, what 
information it will yield, and how that information will contribute to decision 
making. Important considerations include but are not limited to the following: 

a. Relevant and i^ropriate for the intended use. 

b. Founded in the applicable laws and regulations. 

c. AccqMed by the scientific community. 

d. Accompanied by sound interpretive guidance. 

e. Demonstrated track record with a range of dredged material. 

/. Gost-effective. 

g. Simplified “cookbook” available. 

h. “Doable" in a routine fashion by contract laboratories. 

1 . Able to sustain judicial review. 

Most sediment bioassays reaching this stage will likely satisfy most agency 
concerns and considerations. If not, the agencies may recommend diat the 
Uoassay be “recycled” to an earlier phase for further research and develop¬ 
ment 

Training with instructional ‘*cookbook.” There is an initial “learning 
curve” whenever a facility attempts a new test. This is true for all laborato¬ 
ries, experienced or novice, and for all sediment bioassays, simple or complex. 
It is hi^y desirable that this learning curve plateau before bioassays are con¬ 
ducted for regulatory decision making. This requires some form of training. 
Training may be in the form of mentoring where a novice investigator learns 
fiTMn one more experienced. Training may also be provided by the regulating 
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agencies. Alternatively, the private sector may find it profitable to offer this 
training as a service. 


Whatever die mechanism, this training should be carried out using a simpli¬ 
fied step-by-step instructional cookbook. This cookbook should be based on 
the standaid test method. However, including all the detailed supporting docu¬ 
mentation may not be necessary to conduct the bioassay in a technically sound 
manner. The cookbook should include standardized formats for recording data, 
QA/QC, reporting test results, etc. Training with a cookbook will help estab¬ 
lish market-based costs for the bioassay and evaluate the contract laboratory’s 
ability to conduct the test. An important measure of laboratory success during 
the training period is routinely meeting or exceeding the performance criteria 
for negative and positive controls. 

Joint agency recommendations. Once training is complete, the 
USEPAAISAOE may recommend the following: (a) incorporation of the sedi¬ 
ment toxicity bioassay into the regulatory program, (b) conditional use based 
on the performance of additional research, or (c) further research and deve¬ 
lopment prior to regulatory use. The clearest sign of joint agency approval is 
indusion in Section 103 and/or Section 404 Implementation Manuals. 

Periodic review. Once incorporated in the regulatory program, perfor¬ 
mance of die sediment bioassay diould be reviewed periodically, lliis review 
will indicate if the test is performing as expected and reveal any unanticipated 
problems. Periodic reviews serve as forums for technology transfer, allowing 
users and the scientific community the opportunity to share experiences and 
exchange information. 
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4 Peer-Review Comments 


The developmental paradigm for sediment toxicity bioassays reported here 
reflects extensive peer-review comments and discussions. It thus represents 
iMoad scientific omsensus. In addition to goieral comments, reviewers were 
asked to critique three specific aspects of the paradigm: completeness, sequenc¬ 
ing, and relative importance of developmental activities. Below is a summary 
of Aeir comments. 


Completeness of the Developmental Paradigm 

All reviewers indicated diat die paradigm was complete. A few reviewers 
even diougbt it was too comfdete. That is, most of the sediment bioassays in 
use today have not been developed to the extott suggested by the paradigm. 
This is true. All sediment Uoassays lie on a developmoital continuum. Some 
ate more ccmiplete than others. This paradigm describes the developmental 
(Mocess as it should occur. It was never intended to set pass-fail criteria for 
qxcific bioassays. 

Sequencing of Developmental Activities 

All reviewers indicated that die sequencing of developmental activities was 
good. A few were concerned that the paradigm gave the false impression that 
researdi and develtqment proceeds sequentially in a fixed linear fashiotL 
Practitioners realize this is not true. Rather, the process is often iterative with 
smne aspects of research and development proceeding in a sequence not cmi- 
sistent itdth that described in the paradigm for others. Some activities, in fact, 
are accomfdished simultaneously. Still, some efforts logically precede others 
(e.g., a written protocol before interlaboratory studies). The paradigm was not 
meant to advocate strict conformity to a particular sequence. It does, however, 
describe a logical progression for developing sediment toxicity bioassays. 

Relative Importance of Developmental Activities 

Peer reviewers indicated that all die developmental activities in die para¬ 
digm were important Givoi that consensus, the question then becomes, to 
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what extent must each be addressed? For example, how much information is 
necessary to address the issue of test ruggedness? How many and what kinds 
of sediment satisfy the requirement “Testing with a Wider Range of Dredged 
Mitterial”? What types of data and study designs are necessary to field verify 
or field validate a s^iment toxicity bioassay? These questions have no simple 
answers? The type and amount of information to address each ttevelopmental 
activity will have to be made on a case-by-case basis in the context of the 
individual sediment bioassay. 


16 


Chapter 4 Peer-Review Comments 









5 Summary 


a. A paradigm for developing sediment toxicity bioassays for the regula¬ 
tory evaluation of dredged material has been established. 

b. This paradigm is (^ased and parallels tl^ evolution of many sediment 
toxicity bioassays; i.e., initial development by the test proponent, peer 
scrutiny by multiple laboratories, consensus on a standard test method, 
and incoiporation into the USEPA/USACE regulatory dredging pro¬ 
gram. 

c. The developmental paradigm has been extensively peer reviewed and 
reflects the input from over 70 scientists and regulators. 

d. Peer-review comments indicate the paradigm is complete and follows a 
logical sequence. Sediment bioassays that have addressed each element 
of the par^gm may be considered developed. However, the amount 
and type of information required for each developmental activity varies 
on a case-by-case basis for each sediment bioassay. 

e. Scientific acceptance of a sediment toxicity bioassay has occuned when 
(a) written protocol is available, (b) the technical basis for the protocol 
has been published in peer-reviewed journals, (c) the bioassay is consis- 
tmitly used by multiple laboratories, and (d) the bioassay provides inter- 
pret^le results of environmental significance. 

/. Technically sound interpretive guidance, especially for chronic sublethal 
sediment bioassays, remains a significant challenge to the scientific 
community. 

g. Clear guidance for what constitutes field validation and/or verificatitm 
of sediment toxicity bioassays is needed. 

h. A period of training is critical if sediment bioassays are to be success¬ 
fully conducted on a routine basis. At the present time, this training is 
accomplished duough informal mentoring arrangements. Institutional¬ 
ized training may reduce the frequency of invalid and/or suspect trioas- 
say results. 
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i. Although designed specifically for dredged material toxicity bioassays, 
this paradigm can be ada{Xed for developing sediment toxicity bioas¬ 
says for many other applications; e.g., bioaccumulation, field surveys of 
in situ toxicity, environmental risk assessments, etc. 
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t > Provided wrinen comments in 1993 on Dillon (1992). 
§ - Woitohop participant in Denver, CO, 16-17 Jun 93. 
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Table 2 

A Paradigm for Developing Sediment Toxicity Bioassc^a for the 
Regulatory Evaluation of Dredged Material 

PtMW I • InUtol Dwetopmtit by T—t Proponant 

Phase I Scoping 

■ Rationale 

■ Assessment/measurement end points 

■ Test end points 

■ Test species 

- Litarature review 

■ Alpha protocol 

Phase I Laboratory Research and Development 

- Statistical design 

■ E](perimental design and procedure 

• QA/QC _ 

■ Test ruggedness 

- Dredged material testing 

■ Peer-reviewed publications 

■ Beta protocol 

Phase II • Evaluation by Multiple Laboratories 

- Continued research and development 

■ Iniertaboratory studies 

■ Interpretive guidance 

■ Testing with a wider range of dredged material 

■ Spedes sensitivity to major contaminants 

■ Cost and logistics 

■ Peer-reviewed publications 

■ Acceptance by the scientific community 

■ Gamma protocol 
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